Talili tickets #4431

talili-rsec · 2025-11-10T07:46:21Z

The Jira issue number for this PR is: MDEV-34327
The Jira issue number for this PR is: MDEV-34330
The Jira issue number for this PR is: MDEV-34331
The Jira issue number for this PR is: MDEV-19635

Description

MDEV-34327

This patch is the implementation of Oracle’s DBMS_RANDOM package for sql_mode=ORACLE. The random number generator can be seeded with a constant value by calling the INITIALIZE procedure with the seed number.

The following functions were added for the ticket:

INITIALIZE - Initialize the seed
VALUE - Get a random number greater than or equal to 0 and less than 1, with 38 digits to the right of the decimal point (38-digit precision)
RANDOM - Generate a random number
TERMINATE – Terminate the package

MDEV-34330

This patch is the implementation of Oracle’s DBMS_UTILITY package for sql_mode=ORACLE.

The following functions were added for the ticket:

GET_TIME - Return a measure of current time in hundredths of a second
FORMAT_ERROR_BACKTRACE - Display the call stack at the point where an exception was raised, even if the procedure is called from an exception handler in an outer scope
FORMAT_ERROR_STACK - Format the current error stack

For the FORMAT_XXXX functions, resulting line numbers of SQL statement lines of interest, in package routines are based on the line numbers of the "SHOW CREATE" command for the package body.

TODOs:
The error stack formatting differs from Oracle's implementation. In Oracle, FORMAT_ERROR_STACK and FORMAT_ERROR_BACKTRACE can display output from any point in the call stack (bottom, middle, or top). This implementation always formats output as if the request originates from the bottom-most function in the stack, regardless of where the function is actually called from.

MDEV-34331

This patch is the implementation of Oracle’s UTL_I18N package for sql_mode=ORACLE. The package is a set of services that provides additional globalization functionality.

The ANY_CS character set was also implemented so that the character set set by "set names" command or the character set of the database as the character set of the variable set with ANY_CS, especially for the transliterate function.

The following functions were added for the ticket:

TRANSLITERATE - Perform script transliteration
RAW_TO_CHAR - Convert RAW data from a valid character set to a VARCHAR2 string in the database character set
STRING_TO_RAW - Convert a VARCHAR2 string to another valid character set and returns the result as RAW data

MDEV-19635

This patch adds support for dynamic SQL to parse any data manipulation language (DML) or data definition language (DDL) statement using PL/SQL.

The following functions were added for the ticket:

OPEN_CURSOR - Open a new cursor
PARSE - Parse the given statement in the given cursor
EXECUTE - Execute a given cursor
CLOSE_CURSOR - Close the cursor when no longer needed for a session

TODOs:
As of the moment, '1', corresponding to Oracle's NATIVE DBMS_SQL constant, as PARSE's procedure's (language_flag) 3rd argument, is the only one accepted.

Release Notes

MDEV-34327: N/A
MDEV-34330: N/A
MDEV-34331: N/A
MDEV-19635: N/A, but new kb page should be created for this dynamic SQL.

How can this PR be tested?

MDEV-34327:
- mysql-test/mtr pkg_dbms_random
MDEV-34330:
- mysql-test/mtr pkg_dbms_utility
MDEV-34331:
- mysql-test/mtr pkg_utl_i18n
- mysql-test/mtr pkg_utl_i18n-sjis
- mysql-test/mtr pkg_utl_i18n-ujis
- mysql-test/mtr pkg_utl_i18n-non_xliterate
MDEV-19635:
- mysql-test/mtr pkg_dbms_sql

If the changes are not amenable to automated testing, please explain why not and carefully describe how to test manually.

Basing the PR against the correct MariaDB version

This is a new feature or a refactoring, and the PR is based against the main branch.
This is a bug fix, and the PR is based against the earliest maintained branch in which the bug can be reproduced.

PR quality check

I checked the CODING_STANDARDS.md file and my PR conforms to this where appropriate.
For any trivial modifications to the PR, I am ok with the reviewer making the changes themselves.

CLAassistant · 2025-11-10T07:46:29Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
7 out of 12 committers have signed the CLA.

✅ bsrikanth-mariadb
✅ janlindstrom
✅ grooverdan
✅ talili-rsec
✅ mariadb-OlegSmirnov
✅ spetrunia
✅ eworm-de
❌ mariadb-YuchenPei
❌ dr-m
❌ vuvova
❌ montywi
❌ Thirunarayanan
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Ideally spider should throw an error when local and remote tables have conflict definitions. This fixes the test for --view-protocol and --mysqld=--loose-spider-disable-group-by-handler

…ider_db_result in spider_db_store_result This prevents segv on NULL result_list->current->result on handler calls that are not the first. Reasons supporting this change: - spider_db_result::limit_mode is not called anywhere else. The only calls to limit_mode method are on spider_db_conn - it is very unlikely (impossible?) for the connection to change from one backend to another between execution of sql statements and storing result: /* in spider_bg_conn_action: */ if (dbton_handler->execute_sql( sql_type, conn, result_list->quick_mode, &spider->need_mons[conn->link_idx]) // [... 9 lines elided] if (!(result_list->bgs_error = spider_db_store_result(spider, conn->link_idx, result_list->table))) this also means it is very unlikely (impossible?) for the backend type (dbton_id) to differ between conn->db_conn and result_list->current->result, also considering that spider_db_result::dbton_id comes from spider_db_conn::dbton_id: spider_db_result::spider_db_result( SPIDER_DB_CONN *in_db_conn ) : db_conn(in_db_conn), dbton_id(in_db_conn->dbton_id) Since this was the only call to spider_db_result::limit_mode, we also remove the method altogether.

…pider tests This also has the side benefits of catching --view-protocol failures in CI, as --view-protocol often causes spider gbh not to be created

Any InnoDB write workload is marking data pages in the buffer pool dirty. To make the changes durable, it suffices to write to the write-ahead log to facilitate crash recovery. The longer we keep pages dirty in the buffer pool, the better, because the same pages could be modified again and a single write to the file system could cover several changes. Eventually, we must write out dirty pages, so that we can advance the log checkpoint, that is, allow the start of the recovery log to be discarded. (On a clean shutdown, a full checkpoint to the end of the log will be made.) A write workload can be bound by innodb_buffer_pool_size (we must write out changes and evict data pages to make room for others) or by innodb_log_file_size (we must advance the log checkpoint before the tail of the circular ib_logfile0 would overwrite the previous checkpoint). In innodb_log_file_size bound workloads, we failed to set an optimal target for the next checkpoint LSN. If we write out too few pages, then all writer threads may occasionally be blocked in log_free_check() while the buf_flush_page_cleaner() thread is resolving the situation. If we write out too many pages, then the I/O subsystem will be loaded unnecessarily and there will be some write amplification in case some of the unnecessarily written pages would be modified soon afterwards. log_close(): Return the target LSN for buf_flush_ahead(lsn), bitwise-ORed with the "furious" flag, or the special values 0 indicating that no flushing is needed, which is the usual case. log_checkpoint_margin(): Use a similar page checkpoint target as log_close() for the !not_furious case. mtr_flush_ahead(): A wrapper for buf_flush_ahead(). mtr_t::commit_log_release(): Make some common code non-inline in order to reduce code duplication. buf_flush_ahead(lsn, furious=false): Avoid an unnecessary wake-up of the page cleaner if it is scheduled to wake up once per second. Co-developed-by: Alessandro Vetere Reviewed by: Vladislav Lesin

btr_search_lazy_free(): Check for !table->id. It is possible that dict_sys_t::remove(table, ...) has started executing but has not reached the final part yet. If that is the case, let dict_sys_t::remove() observe that table->freed_indexes is empty and let it free the table object. Reviewed by: Thirunarayanan Balathandayuthapani Tested by: Saahil Alam

The debug parameter innodb_trx_rseg_n_slots_debug is not compatible with the bug fix that was implemented in MDEV-30671 and last revised in MDEV-33213, which leads to bogus assertion failures in trx_assign_rseg_low(), regarding look_for_rollover. When needed, this parameter can be better simulated by adding a DBUG_EXECUTE_IF to trx_undo_seg_create() or one of its callers.

…overflow Problem: ======== - When an R-tree root page becomes full and requires splitting, InnoDB follows a specific root-raising procedure to maintain tree integrity. The process involves allocating a new page (Page X) to hold the current root's content, preserving the original root page number as the tree's entry point, and migrating all existing records to Page X. The root page is then cleared and reconstructed as an internal node containing a single node pointer with an MBR that encompasses all spatial objects on Page X. Subsequently, InnoDB should split the records on Page X into two spatially optimized groups using the pick_seeds() and pick_next() algorithms, creating a second page (Page Y) for Group B records while retaining Group A records on Page X. After records are redistributed between Page X and Page Y, the recalculated MBR for Page X must remain within or be smaller than the original MBR stored in the root page's node pointer. Bug scenario: ============ - When root page 4 becomes full, it triggers a split operation where the content is copied to page 7 and root page 4 is cleared to become an internal node. - During the first split attempt on page 7, Group 1 overflows and remaining entries are reassigned to Group 2. - A new page 8 is created and the remaining entry record is inserted, but the combined size of the remaining entry record and new record exceeds the page size limit. - This triggers a second split operation on page 7, where Group 2 overflows again and entries are moved back to Group 1. - When the new record is finally inserted into page 7, it causes the MBR (Minimum Bounding Rectangle) for page 7 to expand beyond its original boundaries. - Subsequently, when InnoDB attempts to update the parent page 4 with the new MBR information, it fails to locate the corresponding internal node, leading to spatial index corruption and the reported failure. Problem: ======== - Second split operation should happen on page 8, not on page 7. - split_rtree_node() considers key_size to estimate record sizes during the splitting algorithm, which fails to account for variable-length fields in spatial records. - In rtr_page_split_and_insert(), when reorganization succeeds, InnoDB doesn't attempt the insert the entry Solution: ======== rtr_page_split_and_insert(): InnoDB should do insert the tuple when btr_page_reorganize() is successful. rtr_page_split_and_insert(): Use the overflow page for consecutive split operation. split_rtree_node(): Store the record length for each record in r-tree node. This should give proper estimation while determining the group entries and also helpful in overflow validation

rtree_mbr_from_wkb::n_node: Change the data type from int to uint16_t and pack it together with the added field key_len. In this way, an increase of sizeof(rtree_mbr_from_wkb) will be avoided.

row_raw_format_str(): Treat the invalid value charset_coll==0 as binary. This could be invoked on FTS_%_CONFIG.key or SYS_FOREIGN.ID or possible other key columns. dtype_is_utf8(): Merge to its only caller row_raw_format_str(). row_raw_format(): Add debug assertions and comments to document when dtype_get_charset_coll(prtype) may be 0.

Fixed by marking MRG_MYISAM changes explicitly as a DDL

The "FETCH FIRST n ROWS WITH TIES" was not enforced when the SELECT used "using index for group-by". This was caused by an optimization which removed the ORDER BY clause when the GROUP BY clause prescribed a compatible ordering. Other GROUP BY strategies used workarounds to still handle WITH TIES, see comment to using_with_ties_and_group_min_max() in this patch for details. QUICK_GROUP_MIN_MAX_SELECT didn't have a workaround. Fix this by disabling removal of ORDER BY when QUICK_GROUP_MIN_MAX_SELECT is used.

…EP_START_POSITION MDEV-35904 - backport MDEV-19210 to 10.11 as referenced by unset environment variables become warnings. We used to run `systemctl set-environment` to pass _WSREP_START_POSITION. This is bad because: * it clutter systemd's environment (yes, pid 1) * it requires root privileges * options (like LimitNOFILE=) are not applied Let's just create an environment file in ExecStartPre=, that is read before ExecStart= kicks in. We have _WSREP_START_POSITION around for the main process without any downsides.

…nd collation associated with views, etc In Query_log_event::do_apply_event there is comparison is used character set in event same as cached charset and if not used charset is changed. Unfortunately, it was done only if thread is replica thread. Fixed by adding condition for Galera applier thread so that comparison is done leading to charset change if event had different charset.

The Debian library dependencies are auto detected and populated as part of shlibs:Depends and the explicit population is suspectable to distro packaging changes. Also fix the server gssapi plugin to the compiled server dependency consistent with other server plugins.

Problem was that thd->lex->m_sql_cmd is not always set especially when user has not provided ENGINE=xxx so requesting option_storage_engine_name from there is not safe. Fixed by accessing thd->lex->m_sql_cmd only when user has used ENGINE= and if not using ha_default_handlerton and requesting engine name after it.

check for thd->killed in the lexer

…table upon INSERT DELAYED sequences, just as triggers or check constraints, disable DELAYED

if a bunch of tables are prelocked, when a table is actually needed in the routine, open_tables picks one table out of the prelocked list with a smallest "distance". Distance is simply a difference between the actual table lock and the requested table lock. Say, if the prelocked set contains both t1 write-locked and t1 read-locked, than an UPDATE will prefer write-locked t1 and SELECT will prefer read-locked. if there's only write-locked table in the set, both UPDATE and SELECT will use it. this doesn't distingush between UPDATE and INSERT, but INSERT marks tables with tables->for_insert_data=1, which causes prelocking to invoke add_internal_tables() and prepare sequences for execution. in this bug there were two prelocked t1's, one for INSERT (with for_insert_data=1) and one for UPDATE. INSERT picks the second (they both are write-locked, so the distance is the same), its sequence is not prepared and crashes. Let's add for_insert_data as the lowest bit into the distance.

Test case only. The bug is fixed by cherry-pick of 2d5db53 "Simplify NEXTVAL(sequence) when used with DEFAULT"

remove the fix for MDEV-25243. It's no longer needed, because a routine can no longer be re-parsed in the middle of a statement.

…re than 100 ORs (Variant 2) SEL_TREE* tree_or(SEL_TREE *X, SEL_TREE *Y) tries to conserve memory by reusing object *X for the return value when possible. MDEV-34620 has added logic to disable construction of index_merge plans for N-way ORs with large N. That logic interfered with object reuse logic: for the parameters of: X = SEL_TREE{ trees=[key1_treeA, key2_treeB]} Y = SEL_TREE{ trees=[key1_treeC]} we would decide to reuse object X. For key1, we would produce key_or(key1_treeA, key1_treeC) but key2_treeB would be just left there. Then, we would construct a range scan from key2_treeB. Fixed by moving the "disable building index_merge plans" logic into a location where it would not interfere with object reuse logic.

* remove the file to be --repeat friendly * specify the correct defaults-group-suffix. Spider doesn't use the conventional .1 group, so MYSQLD_CMD skips a lot of config options, in particular it doesn't read --tmpdir and creates files in /tmp

MDEV-37726 moved wsrep-start-position to INSTALL_RUNDATADIR and made the latter to be created by systemd-tmpfiles. Now postin scriptlet has to run systemd-tmpfiles explicitly to make sure INSTALL_RUNDATADIR exists before restarting the server. followup for 649216e

table->pos_in_table_list is problematic, it can become dangled when an SP is reparsed. Let's avoid it and directly prefer TABLEs where a sequence is ready for inserts. followup for 516f68a

…lumns (Preparation for the main patch) set_statistics_for_table() incorrectly treated indexes with all NULL values the same as indexes with no statistics, because avg_frequency is 0 in both cases. This caused the optimizer to ignore valid EITS data and fall back to engine statistics. Additionally, KEY::actual_rec_per_key() would fall back to engine statistics even when EITS was available, and used incorrect pointer comparison (rec_per_key == 0 instead of nullptr). Fix by adding Index_statistics::stats_were_read flag to track per-index whether statistics were actually read from persistent tables, and restructuring actual_rec_per_key() to prioritize EITS when available.

…olumns When all values in an indexed column are NULL, EITS statistics show avg_frequency == 0. This commit adds logic to distinguish between "no statistics available" and "all values are NULL" scenarios. For NULL-rejecting conditions (e.g., t1.col = t2.col), when statistics confirm all indexed values are NULL, the optimizer can now return a very low cardinality estimate (1.0) instead of unknown (0.0), since NULL = NULL never matches. For non-NULL-rejecting conditions (e.g., t1.col <=> t2.col), normal cardinality estimation continues to apply since matches are possible. Changes: - Added KEY::rec_per_key_null_aware() to check nulls_ratio from column statistics when avg_frequency is 0 - Modified best_access_path() in sql_select.cc to use the new rec_per_key_null_aware() method for ref access cost estimation - The optimization works with single-column and composite indexes, checking each key part's NULL-rejecting status via notnull_part bitmap

- MDEV-34327: System package SYS.DBMS_RANDOM Added the following procedures: - INITIALIZE - Initialize the seed - TERMINATE - Terminate the package Added the following functions: - VALUE - Get a random number greater than or equal to 0 and less than 1, with 38 digits to the right of the decimal point (38-digit precision) - RANDOM - Generate a random number The random number generator can be seeded with a constant value by calling the INITIALIZE procedure with the seed number.

- MDEV-34330: System package SYS.DBMS_UTILITY Added the following functions: - GET_TIME - Return a measure of current time in hundredths of a second - FORMAT_ERROR_BACKTRACE - Display the call stack at the point where an exception was raised, even if the procedure is called from an exception handler in an outer scope - FORMAT_ERROR_STACK - Format the current error stack Resulting line numbers of desired SQL statement lines in package routines are based on the line numbers of the "SHOW CREATE" command for the package body. Known limitation: Outputs similar to those of Oracle, in requesting for the strings not just from the stack's bottom function, but also those in the middle, are not accommodated. Output is always, as if the request is coming from the stack's bottom function.

- MDEV-34331: System package SYS.UTL_I18N The package is a set of services that provides additional globalization functionality. The ANY_CS character set was also implemented so that the character set set by "set names" command or the character set of the database as the character set of the variable set with ANY_CS, especially for the transliterate function. Added the following functions: - TRANSLITERATE - Perform script transliteration - RAW_TO_CHAR - Convert RAW data from a valid character set to a VARCHAR2 string in the database character set - STRING_TO_RAW - Convert a VARCHAR2 string to another valid character set and returns the result as RAW data

- MDEV-19635: System package SYS.DBMS_SQL This patch adds support for dynamic SQL to parse any data manipulation language (DML) or data definition language (DDL) statement using PL/SQL. Added the following functions: - OPEN_CURSOR - Open a new cursor - EXECUTE - Execute a given cursor Added the following procedures: - PARSE - Parse the given statement in the given cursor - CLOSE_CURSOR - Close the cursor when no longer needed for a session As of the moment, '1', corresponding to Oracle's NATIVE DBMS_SQL constant, as PARSE's procedure's (language_flag) 3rd argument, is the only one accepted.

svoj added the External Contribution All PRs from entities outside of MariaDB Foundation, Corporation, Codership agreements. label Nov 10, 2025

talili-rsec force-pushed the talili-tickets branch 5 times, most recently from 60531e8 to 2ed0f32 Compare November 18, 2025 07:24

talili-rsec force-pushed the talili-tickets branch 3 times, most recently from 3a63e05 to 2484b3b Compare November 24, 2025 08:41

vuvova and others added 20 commits November 26, 2025 14:20

fix nullptr-with-nonzero-offset UB

d0a91a0

MDEV-37568 Fix remote table key direction in spider/bugfix.index

b1328c0

Ideally spider should throw an error when local and remote tables have conflict definitions. This fixes the test for --view-protocol and --mysqld=--loose-spider-disable-group-by-handler

MDEV-37810 Introduce combinations of gbh and usual handlers for all s…

b978a59

…pider tests This also has the side benefits of catching --view-protocol failures in CI, as --view-protocol often causes spider gbh not to be created

MDEV-37113 Some cleanup and documentation of spider gbh creation

8fb1f22

MDEV-30265 Spider: Add CODING_STANDARDS

dc37626

MDEV-27675 review fixup

53542e0

rtree_mbr_from_wkb::n_node: Change the data type from int to uint16_t and pack it together with the added field key_len. In this way, an increase of sizeof(rtree_mbr_from_wkb) will be avoided.

MDEV-37903 ALTER TABLE ... ENGINE=MRG_MyISAM is not binlogged as DDL

0905231

Fixed by marking MRG_MYISAM changes explicitly as a DDL

MDEV-37938 very long query cannot be killed quickly

1bd6cca

check for thd->killed in the lexer

MDEV-37906 Server crash or UBSAN errors in Item_func_nextval::update_…

6ad1d8b

…table upon INSERT DELAYED sequences, just as triggers or check constraints, disable DELAYED

vuvova and others added 9 commits November 26, 2025 14:40

MDEV-37345 temporary table, ALTER, recreate sequence

2cc9fde

Test case only. The bug is fixed by cherry-pick of 2d5db53 "Simplify NEXTVAL(sequence) when used with DEFAULT"

cleanup: Sp_caches::sp_caches_swap()

9855d5e

remove the fix for MDEV-25243. It's no longer needed, because a routine can no longer be re-parsed in the middle of a statement.

fix spider/bugfix.mdev_22979 test

2bb9acc

* remove the file to be --repeat friendly * specify the correct defaults-group-suffix. Spider doesn't use the conventional .1 group, so MYSQLD_CMD skips a lot of config options, in particular it doesn't read --tmpdir and creates files in /tmp

MDEV-37345 sequences and prelocking (11.4 version)

7c63865

table->pos_in_table_list is problematic, it can become dangled when an SP is reparsed. Let's avoid it and directly prefer TABLEs where a sequence is ready for inserts. followup for 516f68a

talili-rsec force-pushed the talili-tickets branch 3 times, most recently from 95c82b9 to 3bbd4ad Compare November 27, 2025 07:58

talili-rsec added 2 commits November 28, 2025 15:39

talili-rsec force-pushed the talili-tickets branch from 1a3c4e3 to 3679cba Compare November 28, 2025 08:54

talili-rsec added 2 commits November 28, 2025 16:55

talili-rsec force-pushed the talili-tickets branch from 3679cba to 442f331 Compare November 28, 2025 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Talili tickets #4431

Talili tickets #4431

talili-rsec commented Nov 10, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Nov 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

14 participants

Uh oh!

Talili tickets #4431

Are you sure you want to change the base?

Talili tickets #4431

Conversation

talili-rsec commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

MDEV-34327

MDEV-34330

MDEV-34331

MDEV-19635

Release Notes

How can this PR be tested?

Basing the PR against the correct MariaDB version

PR quality check

Uh oh!

CLAassistant commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

14 participants

talili-rsec commented Nov 10, 2025 •

edited

Loading

CLAassistant commented Nov 10, 2025 •

edited

Loading